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METHOD FOR ASSEMBLY OF MULTIPLE DNA FRAGMENTS 
Field Of The Invpntinti 

The present invention relates to the field of genetic engineering. More 
particularly, the invention relates to a novel method for assembling two or 
more DNA fragments with high efficiency and yield. 

Background of the Invention 

Building DNA constructs is the basis of genetic engineering. Manipulating 
recombinant DNA and its incorporation into vectors is common knowledge 
in the art. For example, fragments obtained by PCR techniques, may be 
incorporated into plasmids for cloning purposes. Generating specific DNA 
constructs by joining together different DNA segments, each encoding 
specific properties and/or possessing specific functions, is also known in the 
art. The state of the art has been published in a very large number of 
books, articles, patent applications, patents and the like, and is usually 
readily available and known to all of skill in the art. For example, a 
comprehensive account of DNA cloning procedures is provided in the three 
volume text by Sambrook ei al. (1989) entitled "Molecular Cloning-a 
Laboratory Manual" 2nd edition, Cold Spring Harbor Laboratory Press. 
This extensive account of the prior art techniques for the combination of 
DNA fragments, cloning and expression thereof, is included herein by 
reference, in its entirety. 

According to the present state of the art, the DNA fragments comprising 
the plasmid to be constructed are cleaved by restriction enzymes from 
larger entities. Next, they are connected utilizing the enzyme Ligase, 
which forms phosphodiester bonds between DNA fragments. Restriction 
enzymes cleave DNA leaving blunt or staggered ends (protruding short 
single stranded termini) also called overhangs. The efficiency of the 
ligation of blunt ended fragments is very low. Using staggered ended 
fragments elevates the efficiency of the ligation process, because the two 
fragments are being held together by the hydrogen-bonds between the 
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complementary overhangs. Because the overhangs produced by most of the 
restriction enzymes are very short (2-4 bases), the connections between the 
complementary overhangs are weak and not very specific. Thus, when 
constructing a plasmid from restriction enzyme produced fragments, the 
yield is very low and there are many illegitimate ligations, leading to 
undesired products. It is therefore necessary to amplify the product usually 
by transfecting cells of choice such as bacterial cells, and then to identify 
and isolate colonies containing the DNA of choice from the ones containing 
undesired products. Only then can another DNA fragment be added by 
repetition of the whole procedure. As a result of this inefficient processes 
the building of complex DNA constructs is a laborious, time consuming and 
expensive process, in which each step necessitates the successful 
completion of the former. The construction of complex molecules may take 
several weeks to several months. Sometimes the completion of such 
constructs is not achieved at all. Another major drawback of the above 
mentioned method is that the availability of restriction site does not 
always coincide the construction demands. One way to overcome the above 
mentioned problems is to use short artificial DNA molecules called 
"linkers". This however further complicates the construction process, 
reduces the yield, increases the percentage of wrong constructs and 
sometimes add undesired foreign sequences. 

Several methods suggest the possibility of connecting DNA fragments by 
producing longer complementary overhangs (8-14 nucleotides). These 
connections are more stable than the ones produced by restriction 
enzymes. In fact, they are stable enough to render the ligation step 
unnecessary and the bacterial cells may be transformed right after the 
fragments are connected. The phosphodiester bonds are generated later, by 
the endogenous bacterial ligation machinery. 

One prior method uses single strand extensions that are created by adding 
nucleotides at the 3' end of a DNA strand in a template-independent 
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fashion (Roychoudhury, R. Gene Amplif. Anal. 2:41-83, 1981). The enzyme 
used in this method, Terminal Transferase, incorporates nucleotides at the 
end of a double-stranded DNA fragment, thus creating a single-stranded 
tail. Since the enzyme uses the nucleotides randomly, the only way to 
ensure that the single-stranded tail will be complementary to a 
corresponding overhang created on a second DNA molecule, is to provide 
for each extension only one of the four nucleotides. The overhangs created 
with this method must therefore be homopolymeric, so that only four types 
of overhangs can be used, corresponding to the residues dA, dC, dG or dT. 
Since the overhangs created on both termini of a DNA fragment must be 
identical, cloning with this method is directionless and can only involve 
two fragments that are connected to each other at both ends, forming a 
circular molecule. Furthermore, the length of the overhangs cannot be 
specifically controlled. Finally, the method necessarily introduces an 
unwanted stretch of nucleotides into the final construct, the length of 
which cannot be determined exactly, making the method unsuitable for the 
purpose of cloning into expression vectors where the reading frame must 
be preserved. 



According to another method (the commercial product "PCR-Direct™", 
manufactured by CLONTECH, Inc., USA) the overhangs are generated 
utilizing the exonuclease activity of a DNA polymerase. 

U.S. Patent No. 5,137,814 describes another method in which the overhang 
is generated by providing at least one dU residue instead of dT, close to the 
terminus of the fragment. The position of the dU residues determines the 
length of the overhang. This method involves an a-purination of the Uracil 
bases. The a-purinated residues no longer have hydrogen-bond connections 
with their complementary bases on the opposite strand. Moreover, they 
destabilize the hydrogen bonds of their neighboring bases as well. The 
resulting 3' protruding termini may connect to a complementary single 
strand sequence. A commercial product using this method is the 



WO 98/38298 



PCT/IL98/00096 



• 4- 

CloneAmp® pAMPl System (manufactured by Life Technologies, Inc., 
USA). In the aforesaid method a 12-base overhang is used. 

The use of overhangs longer than 4 nucleotides for the purpose of fragment 
cloning is also described in several other publications. Rashtchian et al. 
(Anal. Biochem. 206, p. 91-97, 1992), describe 12 nucleotide overhangs 
generated by Uracil DNA Glycosylase (UDG) to achieve high-efficiency 
cloning of single inserts into a vector. Kuijper et al. (Gene 112, p. 147-155, 
1992) and Aslanidis et al. (PCR Methods Appl. 4:172-177, 1994), describe a 
cloning method wherein T4 polymerase is used together with a 
predetermined dNTP to generate overhangs of a certain length in PCR 
products. This method requires a specific sequence to be present in the 
PCR primer. Hsiao et al. (Nucleic Acids Res. 21, p. 5528-5529, 1993) and 
Yang et al. (Nucleic Acids Res. 21, 1889-1893, 1993) disclose generation of 
overhangs by the exonucleolytic activity of Exonuclease III (Exo III) or of 
T4 polymerase. Overhangs of 12 (Aslanidis), 8 (Yang) and 10-14 (Hsiao) 
nucleotides are disclosed. 

The probability of producing a joint molecule composed from three 
molecules is the probability of joining of the first two molecules, multiplied 
by the probability of joining of the second and third molecules. If the 
joining of any two molecule is a rare event, due to the relatively short 
overhang, the joining of three or more molecules becomes practically 
useless for cloning purposes, because of the resulting low efficiency 

It is therefore clear that the present state of the art is not satisfactory and 
there is a need for an improved method by which several DNA fragments 
can be effectively joined together in a directional, predetermined way, in 
one single-step. Such a method will remove the limitations imposed on 
genetic engineering by the aforementioned methods, and will thus 
revolutionize the way in which genetic engineering is performed. 
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SUMMARY OF THE INVENTION 

In one aspect, the invention is directed to a method for assembling two or 
more DNA fragments with high efficiency, comprising: 

a) providing, for each DNA fragment, at least one protruding 
terminus, or "overhang", capable of hydrogen bonding to a complementary 
sequence on at least one strand of a second DNA fragment, said overhang 
having at least 15 bases; and 

b) mixing two or more said DNA fragments under conditions 
suitable to promote joining thereof. 

The method of the invention is based on the very surprising finding that 
increasing the number of bases in complementary overhang from 12 to at 
least 15, permits to reduce the ratio of the reagents from about 1:100 - 300 
to about 1:1. At the same time, this permits to join multiple fragments (3 
or more), because of the high efficiency of the process. 

According to one embodiment of the invention three or more fragments are 
joined by the method of the invention, and the molar ratio of each DNA 
pairs is about 1:1 to 1:50. 

The present invention provides a DNA fragment comprising an overhang 
of at least 15 nucleotides or an end portion suitable to be converted into 
such an overhang. The invention further provides said DNA fragment, for 
use in the above method of the invention. 

According to another preferred embodiment of the invention, the number 
of bases in the overhang is between 20 and 30. While, as stated above, the 
improvement of the process efficacy is very dramatic when passing from 
12 to 15 bases, there is still a further steep improvement in the efficacy, 
when increasing the number of bases to about 20. This increase reaches a 
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plateau above 20 bases. Taking into account that, when joining a number 
of fragments, the overall efficiency of the process decreases, it is of course 
desirable to employ the lowest number of bases in the overhang which still 
provides for increased joining efficiency. As stated, in most cases this 
optimal length will be in the neighborhood of 20 bases. 

As will be appreciated by the skilled person, this is most surprising 
because such a dramatic increase in binding ability of fragments (from 
almost no binding when overhangs are 12 bases long to almost 100% 
binding when the overhangs are 21 bases long) was unexpectable. 
Although some improvement could have been expected by increasing the 
length of the overhang, such a dramatic improvement (two orders of 
magnitude) was entirely surprising. The art, in fact, has continued to use 
stepwise cloning techniques, with all the inherent disadvantages, and has 
not attempted in practice to use overhangs longer than 12 bases. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1-6 illustrate plasmids produced in Example 3. In the Figs. 1- 
6, the dark, cross-hatched bar depicts the fragment amplified from 
pBR322 which carries the Amp r gene and the origin of replication. The 
empty bar depicts the fragment PCR amplified from pBR322 which carries 
the Tet 1 gene. The hatched bar depicts the PCR amplified fragment from 
pACYCl84 which carries the Cm r gene. The arrows mark primers, where 
the arrow head is the 3' end of the primer, and the wavy tails mark the 
various repetitive sequences at the 5' end of the primers. The primer's 
number is given above or below the arrow. The black area denotes 
repetitive sequences situated at the connection between two fragments. 
Fig. 12 illustrates the plasmid produced in example 4. The empty, cross- 
hatched and hatched bars have the above meanings. The arrows mark the 
primers which are used for the seamless junction of the fragments. 
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Fig. 1 is plasmid pGSlOl. There is an overlap of 12bp between 
every two fragments which are joined together. The 
plasmid is assembled from three fragments. 

Fig. 2 is plasmid pGSl02. There is an overlap of 12bp between 
every two fragments which are joined together. The 
plasmid is assembled from four fragments. 

Fig. 3 is plasmid pGS103. There is an overlap of 12bp between 
every two fragments which are joined together. The 
plasmid is assembled from four fragments. 

Fig. 4 is plasmid pGS201. There is an overlap of 20-21bp 
between every two fragments which are joined together. 
The plasmid is assembled from three fragments. 

Fig. 5 is plasmid pGS202. There is an overlap of 20-21bp 
between every two fragments which are joined together. 
The plasmid is assembled from four fragments. 

Fig. 6 is plasmid pGS203. There is an overlap of 20-21bp 
between every two fragments which are joined together. 
The plasmid is assembled from four fragments. 

Fig. 7 shows the results obtained by an Agarose gel run, to 
check fusion ability of identical fragments with different 
overhang lengths. 

Fig. 8 shows the colonies that grew on plates following 
transformation with constructs illustrated in Figures 1- 
6. The results of constructs made with overhangs of 12 
bases are shown on the left and those with overhangs of 



WO 98/38298 



PCT/IL98/00096 



-8- 

20-21 bases are shown on the right. Figs. 8 A, B, C, D, E, 
F, and G correspond to plasmids pGSlOl, pGS201, 
pGSl02, pGS202, pGSl03 and pGS203, respectively. 
The plating was carried out on selective plates that 
would minimize background noise as explained in the 
text. 

Fig. 9 illustrates the constructs made in Example 3, along with 
the number of transform ants obtained; 

Fig. 10 shows the results of two gel runs of the products 
obtained in Example 3; 

Fig. 11 shows the results of gel runs of the products obtained in 
Example 2; 

Fig. 12 shows a construct made by joining 3 different fragments 
using exonuclease Ill-created overhangs in the 
experiment of example 4; 

Fig. 13 shows a construct made by joining 8 different fragments 
in the experiment of example 5. 

DETAILED DESCRIPTION OF THE INVENTION 

General Procedures 

Overview 

The present invention is based on the surprising discovery that by 
increasing the length of complementary overhangs of two DNA fragments 
from 12 bases, as in the prior art, to 15 bases or more, the tendency of the 
fragments to be connected to each other rises, steeply increasing, from 
close to 0% when 12 base overhangs are used, to almost 100% when 
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around 20 bases are employed. When long complementary overhangs of 15 
bases or more are used, a large number of DNA fragments can be 
assembled in a single step in a predetermined, directional manner. In 
addition, when connections between just two fragments are made, the 
yield of cloned products is much higher than that of the prior art, when 
operating according to the invention. 

A number of procedures can be used in order to create these overhangs. In 
the examples detailed below, three different procedures were used. 

The first procedure 

The first procedure is used in examples 1-3. Examples 1 and 2 illustrate 
the difference in the connecting ability of fragments with complementary 
overhangs of different lengths in-vitro by subjecting the connected 
fragments to agarose gel analysis. In the third example, successful cloning 
of a plasmid assembled from five fragments using the procedure was 
demonstrated. The fragments were connected to each other in-vitro, and 
then transformed into bacterial cells. 

The procedure is as follows: 

The fragments which are to be attached to each other in a directional 
fashion are preferably prepared by utilizing the well-established 
Polymerase Chain Reaction (PCR). This provides suitable amounts of the 
fragments and is especially preferred when the fragments to be used are 
originally obtained or available only in small amounts, for example from 
restriction-endonuclease treated plasmid DNA, genomic DNA, or cDNA 
libraries in which the desired fragments are present in small amounts. In 
this way, the original fragments are greatly amplified and by virtue of the 
use of pre-selected specific primers, the ends of the PCR-amplified 
fragments will contain the desired pre-selected complementary sequences. 
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The primers are designed such that they consist of two regions: A 3' region 
that is complementary to the template and a 5* region that contains a 
sequence complementary to a sequence at the 5' end of the fragment it is 
designed to join. In this 5' region, several dT residues are substituted by 
dU residues. During PCR, the thermostable DNA Polymerase does not 
distinguish between the dT and dU residues and incorporates dA opposite 
both of these residues in the newly synthesized strand.. 

Following PCR, the fragments are mixed together and the enzyme Uracyl 
DNA Glycosylase (UDG) is added to the reaction mixture. UDG 
specifically attacks the dU bases and a-purinates them. The a-purinated 
residues no longer have hydrogen bonds with their complementary bases. 
Furthermore, they destabilize the hydrogen bonds of their neighboring 
bases as well. As a result, the double stranded structure near the terminus 
falls apart. Two open strands are created at the end of the segment: 1. A 5' 
single strand with several a-purinated residues that can no longer form 
hydrogen bonds with a complementary strand. 2. A 3' single strand with 
no a-purinated residues that can form such bonds. Although this single 
strand is not, strictly speaking, an overhang, it is equivalent to it and 
behaves as such for all practical purposes. The complementary regions of 
DNA generated in this procedure are made of repetitive sequences. 
Therefore, it can be used to illustrate that the length of the exposed ends 
is the key parameter in determining the tendency of fragments to be 
connected to one other: The difference between the constructs in the 
various experiments is in the lengths of the repetitive complementary 
regions and not in the sequence. 

Two fragments that have such complementary single strands can form 
hydrogen-bond connections with each other. If such fragments are used to 
transform bacterial cells, their 5' a-purinated single strands are degraded 
by cellular enzymes. Furthermore, endogenous ligases convert the joint 
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fragments into a single DNA molecule. Details of the procedure are 
described in Examples 1-3 below. 

The second procedure 

In Example 4, a second procedure was used. This procedure uses the 
enzyme Exonuclease III (herein after ExoIII) to generate the overhangs. 
Other enzymes having an exonuclease activity such as T4 DNA 
polymerase or T7 DNA polymerase may be used as well. In addition to the 
above mentioned enzymes which have 3'->5' exonuclease activity, enzymes 
with 5'->3' exonuclease activity may also be used, depending on the 
circumstances. Procedures similar to this one are disclosed in the above 
Hsiao et al. and Yang et al. The specific procedure is elaborated in a 
copending application of the same applicant herein (filed on the same day 
as this application, and identified as Attorney Docket 4190/96, Israeli 
Patent Application No. 120378). The procedure can be applied for the 
joining of PGR fragments as in examples 1-3 as well as for the joining of a 
mixture of DNA fragments that are generated by restriction enzyme 
cleavage together with PGR generated fragments. In example 4 the 
fragments were produced by PCR to generate a plasmid assembled from 
three segments. 

The procedure is as follows: 

The terminal portions of the fragments that are to be joined have to be 
complementary to each other. This can be achieved by producing the PCR 
fragments in the following way: The primers of a first DNA fragment that 
is to be PCR produced and is to be joined to a second and to a third 
fragment should be synthesized having two regions: 1. A 3' region that is 
complementary to the template. 2. A 5' region that is complementary to 
the terminal portion of the fragment that is to be joined to the first 
fragment. The length of the complementary regions should be 15 
nucleotides (nt) or more. 
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The various DNA fragments to be joined are mixed together and an 
exonuclease enzyme is added to the mixture. In the case of example 4, 
ExoIII was used. ExoIII is an exonuclease that catalyzes the stepwise 
removal of mononucleotides from 3' hydroxyl termini of double stranded 
DNA. It is added in access and the reaction is incubated at 6°C to ensure 
slow digestion. The reaction is allowed to proceed for a time sufficient to 
digest the ends of the fragments, exposing 5' overhangs. These overhangs 
usually consist of a region which is longer than the complementary 
portion. 

Once the reaction is stopped, the temperature is raised to 75°C. The 
reaction mixture is then cooled slowly. This facilitates the joining of 
complementary overhangs while minimizing illegitimate connections. 
Following the cooling, dNTPs and T7 DNA Polymerase are added in order 
to fill any gaps that may have been formed due to ExoIII overdigestion. At 
this stage, the mixture is used to transform bacterial cells. Further 
elaboration of the procedure is given in Example 4 below. 

The third procedure 

In example 5 a third procedure was used. This procedure is similar, in 
many respects to the first one but has the advantage that the junctions 
between the various fragments require no addition of linker nucleotides. 
An elaborated description of this procedure is disclosed in a copending 
application (filed on the same day as this application, and identified as 
Attorney Docket 4149/96, Israeli Patent Application No. 120377). As will 
be demonstrated below, a single dU residue near the termini of the 
fragments, along with UDG and the reagent N,N 
dimethylethylenediamine were employed in this procedure to generate 
seamless connections between fragments. In example 5, the cloning of a 
plasmid assembled from eight separate segments is demonstrated. To 
date, this is the only disclosure of an assembly of a plasmid from so many 
segments in a single step. 
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The procedure is as follows: 

This procedure allows the formation of true 3* overhangs using a single dU 
residue in each of the primers to be connected. It is based on the ability to 
chemically create a nick at the 3' end of an a-purinated dU residue. 

The fragments are produced by PCR. As in the other examples, the 
terminal portions of the fragments that are to be joined have to be 
complementary to each other. As described in example 5, this is achieved 
by producing the PCR fragments in the following way: The primers of a 
DNA fragment that is to be PCR produced and is to be connected to two 
other fragments are synthesized having two regions: 1. A 3' region that is 
complementary to the template. 2. A 5* region that is complementary to 
the terminal portion of the fragment that is to be joined to the said 
fragment. The length of the complementary regions should be 15 nt or 
more. In each primer, a dT residue is substituted by a dU residue at the 
junction between the complementary and non complementary regions. 

The various DNA fragments to be joined are mixed together and UDG, as 
well as N,N dimethylethylenediamine are added to the mixture. The 
enzyme a-purinates the dU base and the chemical generates a nick at the 
3* side of the a-purinated residue. 

Following the creation of the above mentioned nicks, the temperature is 
raised to 70°C in order to disconnect the short single-strand stretch 5* of 
the a-purinated dU residues from the rest of the fragment. Thus, 3' 
overhangs which constitute the complementary regions are created. After 
removing the nicked oligonucleotides, the reaction is cooled slowly to allow 
joining between the complementary overhangs while minimizing 
illegitimate connections. Following the cooling, the mixtures are used to 
transform bacterial cells. Further elaboration of the procedure is found in 
Example 5 below. 
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The examples to follow are provided for the purpose of illustration only, 
and it is clear that the invention is not limited to any particular method or 
procedure for creating the aforesaid overhangs. 

Example 1 

In Vitro Hybrid ization of Fragments with Complementary 
Overhangs of Different Lengths 

Overview 

In order to demonstrate the superiority of connection between DNA 
fragments having long overhangs, the degree of inter-connection of 
fragments with different lengths of overhangs was tested. Five pairs of 
fragments, each containing different length of the complementary region, 
were produced by PCR. Each pair consisted of: 1). A fragment of about 
1400 bp designated Tet, which was PCR amplified from the Tet r gene 
(conferring resistance to tetracyclin) of pBR 322. 2). A fragment of about 
1700 bp designated Amp, which was PCR amplified from the Amp r gene 
(conferring resistance to ampicillin) of pBR322. One of the two primers 
that was used to amplify the Tet fragment in each pair contained, in 
addition to a region which was complementary to the template DNA, a 5' 
tail with repeats of three bases CUA. Likewise, one of the primers used 
for Amp amplification contained a 5' tail with repeats of UAG. Note, that 
CUA and UAG are complementary to one another. In the first pair, the 
tail consisted of four repeats (12 bases) and the tails of the second, third, 
fourth and fifth pairs consisted of five, six, seven and ten repeats 
respectively (15, 18, 21 and 30 bases). 

The Amp fragment and the Tet fragment of each pair were mixed, and 
their 3' overhang exposed utilizing UDG. After an annealing time period 
the mixtures were subjected to agarose gel analysis (Fig. 7). 
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Primers and PCR cnndit.irmg 

For each of the 5 reactions, two fragments were produced. An Amp r 
fragment and a Tef fragment. The only variable between the five Amp' 
fragments (or between the five Tet fragments) was the length, but not the 
sequence, of the complementary region. 

The hybridization rate between pairs of DNA fragments was tested. 
A) The primers and production of the fragments 

In each of the primers below, and in the following examples, the * symbol 
is written below the dU residue in order to emphasize it. All of the primer 
sequences given herein are written in 5' to 3' direction. 

Primer SEQ. ID NO. 1 - Tet sense primer 

(also designated 31160) • upstream of Tet (PBR322: 1-22) 

-irrelevant tail- 22nt 
AGCTCCTGA-TTCTCATGTTTGACAGCTTATC 

Tet antisense primers 

note: the 5 primers below differ from one another only by the number of 
CUA repeats at their 5' end. 

Primer SEQ. ID NO. 2 

(also designated 3781) - downstream of Tet (pBR322: 1449-1425) 

25nt 

— 12nt (CUA)x4— TGG CCA GGA CCC AAC GCT GCC CGA G 

★ 

Primer SEQ. ID NO. 3 

(also designated 4082) - downstream of Tet (pBR322: 1449-1425) 

25nt 

-15nt (CUA)x5- TGG CCA GGA CCC AAC GCT GCC CGA G 
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Primer SEQ. ID NO. 4 

(also designated 4353) - downstream of Tet (pBR322: 1449-1425) 

25nt 

-18nt (CUA)x6— TGG CCA GGA CCC AAC GCT GCC CGA G 
Primer SEQ. ID NO. 5 

(also designated 4635) - downstream of Tet (pBR322: 1449-1425) 

25nt 

— 21nt (CUA)x7— TGG CCA GGA CCC AAC GCT GCC CGA G 

* 

Primer SEQ. ID NO. 6 

(also designated 5535) - downstream of Tet (pBR322: 1449-1425) 

25nt 

— 30nt (CUA)xlO— TGG CCA GGA CCC AAC GCT GCC CGA G 

* 

(Note that primers SEQ. ID NO. 2-6 are homologously complementary to 
primers SEQ. ID NO. 8-12 respectively at their 5' ends) 

Primer SEQ. ID NO. 7 - Amp sense primer 

(also designated 4142) - downstream of Amp (pBR322 2460-2479) 

irrelevant tail 20nt: 

ATTGGTGCCCTTAAACGCCTG-AACGCAGGAAAGAACATGTG 

Amp antisense primers 

note: the 5 primers below differ from one another only by the number of 
UAG repeats at their 5' end. 

Primer SEQ. ID NO. 8 

(also designated 36107) - upstream of Amp (pBR322: 4136-4159) 

24nt 

— 12nt (UAG)x4— AAG AGT ATG AGT ATT CAA CAT TTC 
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Primer SEQ. ID NO. 9 

(also designated 3993) - upstream of Amp (pBR322: 4136-4159) 

24nt 

— 15nt (UAG)x5— AAG AGT ATG AGT ATT CAA CAT TTC 
* 

Primer SEQ. ID NO. 10 

(also designated 4270) - upstream of Amp (pBR322: 4136-4159) 

24nt 

-18nt (UAG)x6— AAG AGT ATG AGT ATT CAA CAT TTC 
* 

Primer SEQ. ID NO. 11 

(also designated 4560) - upstream of Amp (pBR322: 4136-4159) 

24nt 

~21nt (UAG)x7— AAG AGT ATG AGT ATT CAA CAT TTC 
Primer SEQ. ID NO. 12 

(also designated 5434) - upstream of Amp (pBR322: 4136-4159) 

24nt 

— 30nt (UAG)xlO— AAG AGT ATG AGT ATT CAA CAT TTC 

★ 

Note that primers SEQ. ID NO. 8-12 are homologously complementary to 
primers SEQ. ID NO. 2-6 respectively at their 5' ends. 

The template for the PCR reaction was pBR 322 DNA. The full sequence 
and maps of the various regions of this plasmid is well known and can be 
accessed from GenBank database under accession no. J01749 (pBR322). 

The various fragments were produced by PCR, as follows: 
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PCR of Tet fragments 

(the number at the right denotes the length of the overhang) 

Tet 12 Tet 15 Tet 18 Tet 21 Tet 30 

10|il (0. 1 ng/nl) SEQ ID 2 SEQ ID 3 SEQ ID 4 SEQ ID 5 SEQ ID 6 

10nl (0. 1 ug/|il) SEQ ID 1 SEQ ID 1 SEQ ID 1 SEQ ID 1 SEQ ID 1 

pBR322 DNA 40 ng 40 ng 40 ng 40 ng 40 ng 

180ul PCR mix in each tube 

PCR of Amp fragments 

(the number at the right denotes the length of the overhang) 

Amp 12 Amp 15 Amp 18 Amp 21 Amp 30 

10nl (0. 1 jig/ul) SEQ ID 8 SEQ ID 9 SEQ ID 10 SEQ ID 11 SEQ ID 12 

10|al (0. 1 |ag/|il) SEQ ID 7 SEQ ID 7 SEQ ID 7 SEQ ID 7 SEQ ID 7 

pBR322 DNA 40 ng 40 ng 40 ng 40 ng 40 ng 

180^1 PCR mix in each tube 



PCR mix: 



dNTPs (2.5mM each) 128ul 

BuffxlO 160ul 

H.,0 1088ul 

Taq DNA Polymerase 5U/*il 6.4ul 

Temperature regime 
94°C 40sec 
40°C 2mm 
72°C 4mm 
30 cycles 
72°C 5min 
6°C infinitely 



All the inquired fragments were obtained. The concentration of each 
fragment was determined by measuring the density of the band created by 
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each DNA fragment on an agarose gel. The volumes used for the following 
experiment were corrected accordingly. 

B) Checking fusion ability 

Each pair of fragments having the same length of repeats was put in a 
different Eppendorf tube. The DNA was incubated at 37°C with UDG in 
order to create the overhangs and allow hybridization (see details below). 
The products of the reaction were subjected to electrophoresis on an 
agarose gel at 60V for 60min, stained with ethidium bromide and 
photographed under U.V. light (Fig. 7). Amp fragments are 1.7kb in 
length, Tet fragments are 1.4kb, and fragments of Amp+Tet that have 
hybridized to each other are 3.1 kb in length. The difference in the 
density between the Amp band, the Tet band and the Amp+Tet band, 
reflect the ability of the fragments to adhere to each other. Comparisons 
could thus be made between the adherence abilities of fragment-pairs 
having different overhang-lengths. 

The Amp and Tet fragments (of each pair) were mixed together as follows 
(The number of the mix is equivalent to the number of the complementary 
overhang). 



Mix 


12 


15 


18 


21 


30 


Amp frag. 


3ul 


1.5nl 


ini 


lui 


lfil 


Tet frag. 


3m1 


2^1 


2.5fil 


1.5^1 


5nl 


H 2 0 


2^1 


4.5|il 


4.5fil 


5.5nl 


2^1 


UDG Buff. xlO 


lMl 






lui 


1^1 



UDG Buff.xl0= 200mM Tris HC1 pH 8.4, 500mM KC1, 15mM MgCl 2 

The content of each of the tubes was divided into two. In the first tube, 
0.5|J. UDG was added. In the second, 0.5^1 H2O was added as a blank 
control. The tubes were incubated at 37°C for 5 hours. Then, the content 
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of the tubes was transferred into an 0.8% TAE Agarose gel. The gel was 
run at 60V for 60min, stained with ethidium bromide and photographed 
under U.V. light. Results are shown in Fig. 7. 

OVERHANG 12 15 18 21 30 

Tube No. 1 2 3 4 5 6 7 8 9 10 

UDG H 2 0 UDGH2O UDG H2O UDG H2O UDG H2O 

Lane on the gel: 12 34 5678 9 10 

The experiment shows: 1) Almost no fusion is observed when the 
overhangs are 12bp long. 2) In lane 7, there are more Tet fragments then 
Amp fragments. Therefore, even though there are almost no free Amp 
fragments left, some free Tet fragments still exist. 3) Both in lanes 7 
and 9, there are no free Amp fragments left. There is an outstanding 
ability of overhangs of 21 and 30bp to adhere to one another. 



In conclusion, the results of the experiment indicate that: 

1) Almost no fusion is observed when the overhangs are 12bp long. 

2) There is a dramatic increase in adherence starting with 
overhangs 15 bases long. 

3) There is an outstanding ability of almost 100% of overhangs of 
21 and 30 bases to adhere to one another. 

Example 2 

Multi DNA Fragment Assembly (MDFA) in-vitro: Comparison 
between fragments with overhangs of 12 and 20-21 bases 

I. Overview: 

The following experiment was carried out in order to show the stronger 
affinity and higher stability of connection between fragments with 
overhangs of 21 bases versus the connection of fragments having 
overhangs of 12 bases. Two sets of fragments were generated by PCR. In 
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the first set, the length of the complementary regions at each end of the 
fragments was 12 bp. In the second set, the length was 20-2 lbp. The 
complementary regions were repeats of 3 or 5 bases. Therefore, the 
sequence of the complementary regions in analogous fragments of the two 
sets can be regarded as identical, the only difference being the overall 
length of the repeated region. Four different fragments of each set were 
generated by PCR, mixed together and 3' complementary overhangs were 
exposed utilizing UDG. After an annealing time period, they were run on 
agarose gel in order to check the degree of the connection between the 
fragments (Fig. 11). 

II. Primers and PCR conditions: 

Four unique fragments of each set (one set with 12 bp complementary 
sequences and the other set with 21bp complementary sequences) were 
amplified by PCR. The fragments were designated TetA, TetB, Amp, and 
CmB. 

TetA was PCR amplified from pBR322 (bases 1-937). 
TetB was PCR amplified from pBR322 (bases 1067-1449). 
Amp was PCR amplified from pBR322 (bases 2460-4159). 
CmB was PCR amplified from pACYCl84 (bases 3768-4086). 

The fragments were designed to connect with each other in the following 
order: TetA-TetB-Amp-CmB. The primers were synthesized with 3' ends 
that would enable making PCR fragments from pBR322 and pACYCl84 
templates as in the former experiments. Two sets of 8 primers were 
synthesized. In each set, three junctions with complementary repeats were 
designed. A junction between the TetA and the TetB fragments, a junction 
between the TetB and the Amp fragments and a junction between the Amp 
and the CmB fragments. Each of the repeats in a set was different and was 
complementary-homologous only to one of the other repeats. The PCR 
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fragments that were produced could therefore be connected to one another 
via the repeats, to form a linear fusion product. 

Primers of the first set: primers containing repeats or near-repeats of 12 
nucleotides with dUs at their 3' ends 

Primer SEQ ID NO. 13 - Tet A sense 

(also designated 37101) upstream of Tet (pBR322: 1-25) 

12nt 25nt 

AU CAUAU CAUAU TTC TCA TGT TTG ACA GCT TAT CAT C 



* * * * 



Primer SEQ ID NO. 15 - Tet A antisense 

(also designated 37103) 

12nt 25nt (pBR322: 913-937) 

UA CUAUA CUAUA CAT GCC GGC GAT AAT GGC CTG CTT C 

***** 

Primer SEQ ID NO. 16 - Tet B sense 

(also designated 37104) 

12nt 25nt (pBR322: 1067-1091) 

UAUAG UAUAG UA GTA GAT GAC GAC CAT CAG GGA CAG C 

* * * * * 



Primer SEQ ID NO. 2 Tet B antisense 
downstream of Tet (pBR322: 1449-1425) 

- — 12nt CUAx4- — 25nt 

CUA CUA CUA CUA TGG CCA GGA CCC AAC GCT GCC CGA G 
* * * * 



Primer SEQ ID NO. 8 - Amp sense 

upstream of Amp (pBR 322: 4159-4136) 

12ntUAGx4- 24 nt 

UAG UAG UAG UAG AAG AGT ATG AGT ATT CAA CAT TTC 

* * * * 
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Primer SEQ ID NO. 18 - antisense Amp 
(also designated 36108) 

downstream of ORI 

12 nt AUGx4 - pBR322 (2460-2483) 

AUG AUG AUG AUG AAC GCA GGA AAG AAC ATG TGA GCA 

* * * ★ 



Primer SEQ ID NO. 17 - Cm B sense 
(also designated 37106) 

12nt 25nt (pACYCl84: 4086-4062) 

AU CUAAU CUAAU TAC GGT GAA AAC CTG GCC TAT TTC C 
* * * * ★ 



Primer SEQ ID NO. 14 - Cm B antisense 

(also designated 3866) 

12ntCAUx4 26nt (pACYC184: 3768-3793) 

CAU CAU CAU CAU CAG GCG TTT AAG GGC ACC AAT AAC TG 

* * * * 



PCR REACTIONS 

(The number on the right denotes the length of the overhang) 
reaction TetA12 TetB12 Ampl2 Cm B12 

20ml primer 1 SEQ ID 13 SEQ ID 16 SEQ ID 18 SEQ ID 14 
20ml primer 2 SEQ ID 15 SEQ ID 2 SEQ ID 8 SEQ ID 17 
(Primer concentration: 0.1 ng/^l) 
expected 

fragment size: 961bp 406bp 1724bp 340bp 
mix 

2^1 dNTP's (2.5 mM each) 112^1 
2.5|il lOxBuff. I40|il 
18*il H20 1008^1 
0.1^1 Taq(5u/ul) 5.6^1 
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put 360nl mix in each tube, add primers 

add lul DNA pBR322 P BR322 pACYCl84 pBR322 

(DNA concentration: 40 ng/fil) 

Temperature refr imp 
94°C 40sec 
40°C 2min 
72°C 4min 
30 cycles 
72°C 5min 
6°C infinitely 

Primers of the second set: primers containing repeats of 21 nucleotides 

with dUs at their 3' ends 

Primer SEQ. ID NO. 19 - TetA sense 

(also designated 4554) - upstream of Tet (pBR322: 1-25) 

20nt CAUAUx4 25nt 

CAUAU CAUAU CAUAU CAUAU TTC TCA TGT TTG ACA GCT TAT CAT C 

* * * * * * * * 

Primer SEQ. ID NO. 21 - TetA antisense 

(also designated 4558) - (pBR322: 913-937) 

20nt CUAUAx4 25nt 

CUAUA CUAUA CUAUA CUAUA CAT GCC GGC GAT AAT GGC CTG CTT C 

* * * * * * ★ * 

Primer SEQ. ID NO. 22 - TetB sense 

(also designated 4559) - (pBR322: 1067-1091) 
20nt UAUAGx4 25nt 

UAUAG UAUAG UAUAG UAUAG GTA GAT GAC GAC CAT CAG GGA CAG C 

** * * ★ * * * 



Primer SEQ. ID NO. 5 - TetB antisense 

downstream of Tet (pBR322: 1449-1425) 

21nt CUAxT 25nt 

CUA CUA CUA CUA CUA CUA CUA TGG CCA GGA CCC AAC GCT GCC CGA G 
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Primer SEQ. ID NO. 11 - Amp sense 

-pBR322 (4159-4136) 
- 21nt UAGxT 2 4nt 

UAG UAG UAG UAG UAG UAG UAG AAG AGT ATG AGT ATT CAA CAT TTC 

******* 



Primer SEQ. ID NO. 24 Amp antisense 

(also designated 4561) . p BR322 coding (2460-2483) 
21nt AUGx7 24 nt 

AUG AUG AUG AUG AUG AUG AUG AAC GCA GGA AAG AAC ATG TGA GCA 
* * * * * * • 



Primer SEQ. ID NO. 23 - Cm B sense 

(also designated 4556) - (pACYCl84: 4086-4062) 

20nt CUAAUx4 25nt 

CUAAU CUAAU CUAAU CUAAU TAC GGT GAA AAC CTG GCC TAT TTC C 



Primer SEQ. ID NO. 20 - CmB antisense 

(also designated 4743) - (pACYCl84: 3768-3793) 
21ntCAUx7 26nt 

CAU CAU CAU CAU CAU CAU CAU CAG GCG TTT AAG GGC ACC AAT AAC TG 
* * * * * * * 



PCR REACTIONS 



Amp21 
Seq. ID 11 
Seq. ID 24 



1742bp 



reaction TetA21 Tet B21 

20ul primer 1 Seq. ID 19 Seq. ID 22 
20ul primer 2 Seq. ID 21 Seq. ID 5 
(primer concentration: 0.1 ug/ul) 
expected 

fragment size: 977bp 423bp 

mix x i28 
2ul dNTP's (2.5 mM each) 256ul 
2.5ul lOxBuff. 320^ 
18ul H20 2304ul 
O.lul Taq 5UV1 12.8^1 
(dNTP concentration: 2.5 mM each) 

put 360ml mix in each tube, add primers 

add lul DNA P BR322 p BR322 P ACYC184 

(DNA concentration: 40 ng/nl) 



Cm B21 
Seq. ID 20 
Seq. ID 23 



357bp 



pBR322 
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Temperature rep pmp 
94°C 40sec 
40°C 2min 
72°C 4min 
30 cycles 
72°C 5min 
6°C infinitely 

III. Fusion of the fragments 

After the amplification, the fragments were mixed and 3* complementary 
overhangs were exposed utilizing UDG. Fusions of either 2 fragments, 3 
fragments or 4 fragments of the two sets (i.e., of fragments that were 
created with primers of the two above-described sets) were performed. 
After the fusion the reactions were run on a 0.8% agarose gel in Ix TAE 
stained with ethidium bromide and photographed under U.V. light (Fig. 
11). 

In order to demonstrate the size of the various fragments, they were run in 
separate lanes (TetA, TetB, Amp, and CmB, in lanes 1, 4, 7, and 10 
respectively). Only fragments of the first set were run, since those of the 
second set have identical lengths. The various mixtures that included the 
fused fragments were also run. 
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Reactions and lanes no. on the gel: 



5. 



1.8ul 
Tet A12 

1.8^1 
Tet A21 

1.8pl 
Tet A12 

1.8nl 
Tet B12 

1.8^1 
TetA21 



1.8^1 
Tet B21 

1.8nl 
Tet B12 



1.8^1 
TetB21 



13.6^1 2^1 0.8^1 

H2O UDG BuffxlO UDG 

13.6nl 2^1 0.8m1 

H2O UDG BuffxlO UDG 



1-8^1 ll.8\xl 2ul 0.8^1 

Amp21 H2O UDG BuffxlO UDG 



1.8ul 
TetA12 

1.8ul 
Amp 12 



1.8^1 1.8^1 11.8^1 2^il 0.8m1 

TetB12 Ampl2 H2O UDG BuffxlO UDG 



8. 



1.8nl 1.8^1 1.8^1 1.8^1 10^ 2^ 0.8^1 

TetA21 TetB21 Amp21 CmB21 H2O UDG BuffxlO UDG 



1.8ul 1.8^1 1.8ul 1.8^1 10|il 2^1 0.8nl 

TetA12 TetBl2 Ampl2 CmB12 H 2 0 UDG BuffxlO UDG 



10. 



1.8^1 
CmBl2 



11. Molecular Weight Markers 

the concentration of all the DNA fragments was 0.09 pmol/|il. 
UDG Buff.xlO= 200mM Tris HC1 pH 8.4, 500mM KC1, 15mM MgCl 2 



The reactions were incubated at 37°C for 5 hours and then run on a gel. 
The results are shown in Fig. 11. The order of the lanes is shown at the 
top, and the sizes of the molecular weight markers are indicated at the 
right side of the photograph. 
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From the above results, the following conclusions can be drawn: 

1. Detectable fusion products are observed when the overhangs are 21 
bp long, but not when they are 12bp long. 

2. Since the fusion efficiency of 21 bp overhangs is high but less then 
100% 3 products of partial fusions are also observed. In lanes 2, 5 and 8, the 
band which appears highest on the gel, has the length of the required end 
product. 

3. The results demonstrate that the amount of the required product 
decreases when the number of fragments required to generate it, increases. 

This example clearly demonstrated the advantage of using fragments with 
long overhangs. No end products were observed when using fragments 
with 12 nt overhangs. When using fragments with 20-21 nt overhangs, the 
fusion of 2-3 and 4 fragments is clearly demonstrated. 

Example 3 

Multi DNA Fragment Assembly (MDFA) in-vivo: Comparison 
between fragments with overhangs of 12 and 20-21 bases 

The following experiment illustrates the advantage of connecting 
fragments having 20-21 base overhangs as compared to fragments having 
12 base overhangs, in vivo. Fragments having either 12 base overhangs or 
20-21 base overhangs were produced by PCR and connected, forming 
circular plasmids. These plasmids were transformed into bacterial cells 
and the colonies were examined. 

Two sets of plasmids were constructed, as depicted in Figs. 1-6. In the first 
set, fragments containing 12 base overhangs were used (Figs. 1-3). In the 
second, fragments containing 20-21 base overhangs were used (Figs. 4-6). 
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Each set contained three plasmids: 

1. A plasmid constructed out of the three basic fragments. 

an Amp 1 fragment (hereinafter Amp) which contained the Ampicillin 
resistance gene. 

- a Tet r fragment (hereinafter Tet) which contained the Tetracycline 
resistance gene. 

a Cm 1 ' fragment (hereinafter Cm) which contained the 
Chloramphenicol resistance gene. 

2. A plasmid constructed out of four fragments; the said Amp and Cm 
fragments and two fragments designated TetA and TetB, which are two 
separate regions of the Tet gene. 

3. A plasmid constructed out of four fragments; the said Amp and Tet 
fragments and two fragments designated CmA and CmB, which are two 
separate regions of the Cm gene. 

The plasmids of the first set are equivalent to the plasmids of the second 
set except for the length of the overhangs produced. The overhangs of the 
fragments were designed and produced by adding short repeats of either 3 
or 5 bases. Each of the repeats in a set, was different and was 
complementary to one of the other repeats. The PCR fragments that were 
produced could therefore be connected to one another via the repeats to 
form a full circular plasmid (see Figs. 1-6). Analogous primers of the two 
sets were nearly identical, the only difference being in the length of the 
repeat sequence. In the first set, the overall lengths of the repeats was 12 
bp. In the second, the overall length of the repeats was 20-21 bp. In all 
cases, the dT residues in the repeats have been changed to dU residues. 
The 5' termini of fragments containing the repeats could therefore be 
dissociated from the complementary strand by a-purinating the dU 
residues ; exposing a 3' overhang of the required size. 
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After amplifying the fragments comprising each plasmid they were mixed 
and their termini exposed utilizing UDG. After an annealing time period 
each mixture was used to transform bacterial cells, as detailed below. 

A) Primers and PCR conditions 

Primers were synthesized with 3' ends that would enable generating PCR 
fragments from pBR322 and pACYCl84 templates. The full sequence and 
maps of the various regions of these plasmids are well known and can be 
accessed from GenBank database under accession Nos. J01749 (pBR322), 
and X06403 (pACYCl84). 
Two sets of primers were synthesized: 

First set of pri mers: primers containing repeats or near-repeats of 12 
nucleotides with dUs at their 3' ends 

Seq. ID NO. 13 - Tet A sense 

upstream of Tet (pBR322: 1-25) 

12nt 25nt 

AU CAUAU CAUAU TTC TCA TGT TTG ACA GCT TAT CAT C 



* * * * 



Seq. ID NO. 15 - Tet A antisense 



downstream of Tet (pBR322: 913-937) 

12nt 25nt 

UA CUAUA CUAUA CAT GCC GGC GAT AAT GGC CTG CTT C 



* * * * 



Seq. ID NO. 16 - Tet B sense 



(pBR322: 1067-1091) 

-12nt 25nt 

UAUAG UAUAG UA GTA GAT GAC GAC CAT CAG GGA CAG C 



* * * * 



Seq. ID NO. 2 - Tet B antisense 



downstream of Tet (pBR322: 1449-1425) 

12nt CUAx4- — 25nt 

CUA CUA CUA CUA TGG CCA GGA CCC AAC GCT GCC CGA G 
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Seq. ID NO. 18 - Amp sense 

- upstream of Amp (pBR322 2460-2483) 

12 bp AUGx4 ---- 24nt 

AUG AUG AUG AUG AAC GCA GGA AAG AAC ATG TGA GCA 
* * * * 



Seq, ID NO. 8 - Amp antisense 

- downstream of Amp (pBR322: 4159-4136) 

12 bp UAGx4 24 nt 

UAG UAG UAG UAG AAG AGT ATG AGT ATT CAA CAT TTC 
* * * * 

Seq. ID NO. 17 - Cm B sense 

- (pACYCl84: 4086-4062) 

12bp 25nt 

AU CUAAU CUAAU TAC GGT GAA AAC CTG GCC TAT TTC C 

* * * * * 

Seq. ID NO. 14 - Cm B antisense 

downstream of Cm (pACYCl84: 3768-3793) 
12nt CAUx4- — 26nt 

CAU CAU CAU CAU CAG GCG TTT AAG GGC ACC AAT AAC TG 

* * * * 



Seq. ID NO. 25 - Cm A sense 

(also designated 37102) - upstream of Cm (pACYCl84: 240-216) 
12nt 25nt 

AUAUG AUAUG AU TCA GGA GCT AAG GAA GCT AAA ATG G 



Seq. ID NO. 26 - Cm A antisense 

(also designated 37105) - (pACYCl84: 22-46) 

12bnt 25nt 

AUUAG AUUAG AU CAG GCG GGC AAG AAT GTG AAT AAA G 
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PCR reactions 

reaction TetA12 Tet B12 Cm B12 CmA12 

20^1 primer 1 SEQ. ID 13 SEQ. ID 16 SEQ. ID 14 SEQ. ID 25 
20^1 primer 2 SEQ. ID 15 SEQ. ID 2 SEQ. ID 17 SEQ. ID26 
(primer concentration: 0.1 jag/|il) 
expected 

fragment size: 961bp 406bp 340bp 241bp 



reaction Tetl2 Cm 12 Amp 12 

20^1 primer 1 SEQ. ID 13 SEQ. ID 14 SEQ. ID 18 

20ul primer 2 SEQ. ID 2 SEQ. ID 25 SEQ. ID 8 
(primer concentration: 0.1 ng/ul) 
expected 

fragment size 1473bp 739bp 1724bp 



mix xl28 

2ul dNTP's (2.5 mM each) 256ul 
2.5nl lOxBuff. 320ul 

18ul H20 2304ul 
0.1(il Taq (5U/ul) 12.8ul 

put 360ul mix in each tube, add primers 



add l\d DNA (40ng/|al): 

Tet A12 Tet B12 
pBR322 pBR322 



Cm B12 Cm A12 
PACYC184 P ACYC84 



Tet 12 Cm 12 Amp 12 

pBR322 pACYCl84 pBR322 
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Temperature regime 

94 °C 40sec 

40 °C 2min 

72 C C 4min 

30 cycles 

72 °C 5min 

6° C infinitely 



Second se t of primers: primers containing repeats of 21 nucleotides with 

dUs at their 3' ends 



SEQ. ID NO. 19 - TetA sense 

-upstream of Tet (pBR322: 1-25) 

20nt CAUAUx4 25nt 

CAUAU CAUAU CAUAU CAUAU TTC TCA TGT TTG ACA GCT TAT CAT C 
** ** ** * * 



SEQ. ID NO. 5 TetB antisense 

- downstream of Tet (pBR322: 1449-1425) 

- — 21nt CUAxT 25nt 

CUA CUA CUA CUA CUA CUA CUA TGG CCA GGA CCC AAC GCT GCC CGA G 

* * * * * * * 



SEQ. ID NO. 21 - TetA antisense 

(pBR322: 913-937) 

20nt CUAUAx4 25nt 

CUAUA CUAUA CUAUA CUAUA CAT GCC GGC GAT AAT GGC CTG CTT C 

* * * * * * * * 



SEQ. ID NO. 22 - TetB sense 

-(pBR322: 1067-1091) 

20nt UAUAGx4 25nt 

UAUAG UAUAG UAUAG UAUAG GTA GAT GAC GAC CAT CAG GGA CAG C 

******** 



SEQ. ID NO. 20 - CmB antisense 

(pACYC184: 3768-3793) 

21nt CAUx7 26nt 

CAU CAU CAU CAU CAU CAU CAU CAG GCG TTT AAG GGC ACC AAT AAC TG 
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SEQ. ID NO. 27 - CmA sense 



(also designated 4555) - (pACYCl84: 240-216) 

20nt AUAUGx4 25nt 

AUAUG AUAUG AUAUG AUAUG TCA GGA GCT AAG GAA GCT AAA ATG G 

* * * * * * * * 

SEQ. ID NO. 28 - Cm A antisense 

(also designated 4557) - (pACYCl84: 22-46) 

20ntAUUAGx4 25nt 

AUUAG AUUAG AUUAG AUUAG CAG GCG GGC AAG AAT GTG AAT AAA G 

** * ★ * * * * 

SEQ. ID NO. 23 - Cm B sense 

(pACYCl84: 4086-4062) 

— 20nt CUAAUx4 25nt 

CUAAU CUAAU CUAAU CUAAU TAC GGT GAA AAC CTG GCC TAT TTC C 

******** 



SEQ. ID NO. 11 - Amp antisense 

pBR322 (4159-4136) 

21nt UAGxT 24 nt 

UAG UAG UAG UAG UAG UAG UAG AAG AGT ATG AGT ATT CAA CAT TTC 
******* 



SEQ. ID NO. - 24 Amp sense 

pBR322 (2460-2483) 

21nt AUGx? 24 nt 

AUG AUG AUG AUG AUG AUG AUG AAC GCA GGA AAG AAC ATG TGA GCA 
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reaction TetA21 TetB21 CmB21 CmA21 

20ul primer 1 SEQ. ID 19 SEQ. ID 22 SEQ. ID 20 SEQ. ID 28 

20^1 primer 2 SEQ. ID 21 SEQ. ID 5 SEQ. ID 23 SEQ. ID 27 
(primer concentration: 0.1 ng/ul) 
expected 

fragment size: 977bp 423bp 357bp 257bp 



reaction 
20ul primer 1 
20(j.l primer 2 



Tet21 
SEQ. ID 19 
SEQ. ID 5 



(primer concentration: 0.1 ug/pl) 
expected 

fragment size: 1489bp 



Cm21 Amp21 
SEQ. ID 20 SEQ. ID 11 
SEQ. ID 27 SEQ. ID 24 



756bp 



1742bp 



mix X 128 

2nl dNTP's (2.5 mM each) 256ul 

2.5ul lOxBuff. 320ul 

18^1 H20 2304^1 

0.1^1 Taq (5 u/nl) 12.8ul 

put 360ul mix in each tube, add primers 
add lul DNA (40 ng/p.1) 



Tet A21 
pBR322 

Tet 21 
pBR322 



Tet B21 
pBR322 

Cm21 
PACYC184 



Cm B21 
PACYC184 

Amp21 
pBR322 



Cm A21 
pACYC84 
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Temperature regime 
94 °C 40sec 
40°C 2min 
72°C4min 
30 cycles 
72 °C 5min 
6° C infinitely 

The required plasmids are illustrated in Figs. 1 through 6. Figures 1-3 
depict plasmids constructed from fragments containing 12 nucleotide (nt) 
overhangs. Figures 4-6 depicts plasmids constructed from fragments 
containing 21 nt overhangs. 



B. Connecting the fragments 

The PCR fragments were mixed and their 3' overhangs exposed by 
utilizing UDG. The various reactions are given below. Two identical 
independent experiments were conducted. The DNA fragments for each 
experiment were prepared independently as well. 



The concentration of the DNA fragments was 0.05 pmol/fil. 

UDG Buff. xlO = 200mM Tris HC1 pH 8.4, 500mM KC1, 15mM MgCl 2 . 



Reactions mixtures: 



lul 

1. Amp 12 


lul 
Tetl2 


Cm 12 


mi 

H2O 


7ul 
H2O 


1.2ul 
UDG Buff.xl0 


0.8ul 
UDG 


ml 

2. Amp 12 


lfd 
Tetl2 


lul 
CmA12 


inl 

CmBl2 


7ul 
H2O 


1.2ul 
UDG Buff.xl0 


0.8ul 
UDG 


mi 

3. Amp 12 


mi 

TetA12 


mi 

TetBl2 


mi 

Cml2 


7ul 
H2O 


1.2ul 
UDG BuffxlO 


0.8ul 
UDG 


mi 

4. Amp 21 


mi 

Tet21 


mi 

Cm21 


mi 

H 2 0 


7(il 
H2O 


1.2ul 
UDG Buff.xlO 


0.8ul 
UDG 


mi 

5. Amp 21 


mi 

Tet21 


mi 

CmA21 


mi 

CmB21 


7|xl 
H2O 


1.2ul 
UDG Buff.xlO 


0.8ul 
UDG 


mi 

6. Amp 21 


mi 

TetA21 


mi 

TetB21 


mi 

Cm21 


7ul 
H2O 


1.2ul 
UDG Buff.xlO 


0.8ul 
UDG 
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The reactions were incubated at 37°C for 5 hours. 1 ^1 of each reaction was 
used in the electroporation of 20^1 GibcoBRL ElectroMax DH10B cells. 
After the transformation the cells of reactions 1 and 4 were plated on LB 
plates containing Ampicillin, Tetracycline and Chloramphenicol, for 
selection (Figs. 8A and 8B respectively). Likewise, the cells of reactions 2 
and 5 were plated on LB plates containing Ampicillin and Tetracycline 
(Figs. 8C and 8D respectively). Cells of reactions 3 and 6 were plated on 
LB plates containing Ampicillin and Chloramphenicol (Figs. 8E and 8F 
respectively). The fusion fragments CmA-CmB and TetA-TetB contain only 
the upstream and downstream fragments of Cm r and Tet r . In addition they 
are interconnected by short linker repeats. They cannot therefore confer 
resistance to chloramphenicol or tetracycline. 

An illustration of the constructs along with the number of transformants 
obtained, is given in Fig. 9. In the figure, colonies marked with a * were 
checked by PCR as detailed below and proven to contain plasmids with 
rearrangements rather than the required constructs. 

Further examination of the constructs by PCR 

The 6 transformants of reactions 2 as well as 16 representatives from 
reaction 5 were PCR tested in order to check whether they contain the 
desired constructs: Since these transformants were resistant to both 
Ampicillin and Tetracycline it was assumed that these fragments had 
fused correctly to each other. It was desired to know whether the Cm A and 
Cm B were present as well. Therefore the existence of the Cm fragments 
was tested by checking the various transformants by PCR with primers 
SEQ. ID NO. 14 and SEQ. ID NO. 25. Since in this constructs, Cm A 
(which encompasses part of the 5' region of Cm) is 257bp long and Cm B 
(which encompasses part of the 3' region of Cm) is 357bp long, the overall 
size of the expected PCR product should be 593 bp long, some 142 bp 
shorter than the size of a complete Cm fragment (756 bp long). 
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In Fig. 10(a), the six left lanes are of the transformants obtained from 
reaction 2, the next five lanes are from reaction 5. The right lane is a full 
Cm-fragment control. 



In Fig. 10 (b), the right lane is a full Cm-fragment control. The rest of the 
lanes are of more transformants from reaction 5. 

The results indicate that none of the transformants in reaction 2 contain 
the desired constructs. In reaction 5, 9 out of 16 transformants contain the 
desired product. 



Transformants from reactions 4, 5 and 6 were further tested by PCR to 

check if they contain the desired fragments in the desired order, as 
detailed below: 

reaction: A BCD 

Tet CmATetA Cm B Amp Amp Tet B 

primer 1 SEQ. ID 13 SEQ. ID 26 SEQ. ID 17 SEQ. ID 16 

primer 2 SEQ. ID 2 SEQ. ID 15 SEQ. ID 8 SEQ. ID 18 
expected 

fragment size: 1473bp 1180bp 2043bp 1207bp 

The tests carried out according to the above, verified that the 
transformants from reactions 5 and 6 (which were plasmids constructed 
from fragments having 21 nt overhangs) contained the desired fragments 
in the correct order. In reaction 5, 9 out of 16 transformants contained the 
desired product. In reaction 6, 2 out of 2 transformants contained the 
desh^ed product. 



This experiment clearly demonstrates the advantage of constructing 
plasmids from fragments with long overhangs (around 21 nt). Almost no 
colonies were obtained in the 3 experiments (1-3) that used fragments with 
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overhangs of 12 nt. The only 6 colonies that were observed did not contain 
the correct construct. In contrast, in the experiments that used fragments 
with overhangs of 21 nt, colonies containing the correct product were 
readily obtained. 

Example 4 

Assembly of Th ree DNA Fragments Using ExoIII-created 

Overhangs 

Exonuclease III (ExoIII) of E. coli was used in order to create 
complementary overhangs of 20 bases or longer. ExoIII digests one strand 
of blunt-ended DNA or DNA containing 5' overhangs, in a 3' to 5' direction, 
thus creating a 5* overhang or enlarging an existing 5' overhang. By 
regulating the temperature and time of the reaction one can control the 
extent of the digestion, hence the length of the overhang. 

The formation of a construct made of three fragments is illustrated. Using 
the same method, plasmids made out of four and five segments were also 
constructed (not shown). 

The plasmid illustrated in Fig. 12 was assembled by the joining of three 
independently produced DNA fragments. These three DNA fragments are: 

a) a DNA fragment of 1739 base pairs (bp) containing the Amp r 
gene and the ColEl-ori region; 

b) a DNA fragment of 1466 bp containing the Tet r gene; and 

c) a DNA fragment of 745 bp containing the Cm 1 gene. 

These fragments are illustrated schematically in Fig. 12 showing their 
relative positioning one to the other, namely, that the Cm r fragment was 
to be connected at its one end to one end of the Tet r fragment and at its 
other end to one end of the Amp r + ColEl-ori fragment, and likewise, the 
other ends of the Tet 1 * and Amp r + ColEl-ori fragments were to be 
connected to each other to provide for a circular DNA molecule, being the 
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desired plasmid, having the above predetermined order of the three 
fragments. 

To produce the above three DNA fragments, the aforementioned PCR 
procedure was carried out using the following primers and template DNA: 

a) The 1739 bp Amp r + ColEl-ori DNA fragment was synthesized by 
the PCR procedure using primers SEQ. ID NO. 29 (also designated 4142) 
and SEQ. ID NO. 30 (also designated 3884) and pBR322 as the template 
DNA. The concentrations of the primers and template DNA, as well as the 
other PCR conditions, are as indicated above. The relative direction of the 
primers with respect to the synthesis of the Amp r + ColEl-ori fragment as 
it is positioned in the completed plasmid product is as depicted 
schematically in Fig. 12. Thus, primer SEQ. ID NO. 29 was synthesized 
to have a predetermined sequence so as to provide for the desired junction 
region between the Amp r + ColEl-ori fragment and the Cm r fragment, and 
primer SEQ. ID NO. 30 was synthesized to have a predetermined 
sequence so as to provide for the desired junction between the Ampr + 
ColEl-ORI fragment and the Tet r fragment. 

b) The 1466 bp Tet r DNA fragment was synthesized by the PCR 
procedure using primers SEQ. ID NO. 31 (also designated 31160) and 
SEQ. ID NO. 32 (also designated 30397) and pBR322 as the template 
DNA. The concentrations of the primers and template DNA, as well as 
the other PCR conditions, are as indicated above. The relative direction of 
the primers with respect to the synthesis of the Tet r fragment as it is 
positioned in the completed plasmid product is as depicted schematically 
in Fig. 12. 

c) The 745 bp Cm r DNA fragment was synthesized by the PCR 
procedure using primers SEQ. ID NO. 33 (also designated 3595) and SEQ. 
ID NO. 34 (also designated 4143) and pACYCl84 as the template DNA. 
The relative direction of the primers with respect to the synthesis of the 
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Cm r fragment as it is positioned in the completed plasmid product is as 
depicted schematically in Fig. 12. 

The sequences of the above primers are as follows: 

The arrows inside the seuqences mark the junction points. 

primers SEQ. ID NO. 29 and SEQ. ID NO. 30 
for the Amp r + ColEl-ori segment: 

part of Cm r region sequence part of Amp' region sequence 
primer SEQ. ID NO. 29: ATTGGTGCCCTTAAACGCCTGlAACGCAGGAAAGAACATGTG 
(also designated 4142) 

part of Tet r region sequence part of Amp r region sequence 

primer SEQ. ID NO. 30: AGCGTTGGGTCCTGGiCCAAAGAGTATGAGTATTCAACA 
(also designated 3884) 

primers SEQ. ID NO. 31 and SEQ. ID NO. 32 
for the Tet 1 ' segment: 

part of 

Cm 1 region sequence part of Tet r region sequence 

primer SEQ. ID NO. 31: AGCTCCTGAlTTCTCATGTTTGACAGCTTATC 
(also designated 31160) 

part of 
Amp r + ColEl-ori 
region sequence part of Tet r region sequence 

primer SEQ. ID NO. 32: ATACTCTTiTGGCCAGGACCCAACGCTGCCC 
(also designated 30397) 

primers SEQ. ID NO. 33 and SEQ. ID NO. 34 

for the Cm r segment: 

part of 
Tet r region 

sequence part of Cm r region sequence 

primer SEQ. ID NO. 33: AAACATGAGAAiTCAGGAGCTAAGGAAGCTAAAATG 
(also designated 3595) 



The Amp region The CM region 

primer SEQ. ID NO. 34: ATGTTCTTTCCTGCGTTiCAGGCGTTTAAGGGCACCAATAAC 
(also designated 4143) 



WO 98/38298 



PCT/1L98/00096 



-42- 

In view of the fact that the original primer sequences were derived only 
from the Tet 1 ', Cm r and Amp r + ColEl-ori region genes, with no 
introduction of any additional ("linker") DNA sequences whatsoever, the 
above junctions between said genes are "seamless". 

The PCR was carried out according to the description in the former 
examples. Each PCR fragment, separately, was then subjected to agarose- 
gel purification using a commercial kit, namely, Bio-Rad's Prep-A-Gene™ 
purification kit and following the manufacturer's instructions. 

Once purified, each PCR fragment was then quantitated by determining 
the DNA concentration of each fragment by standard procedures, using 
Pharmacia's Gene-Quant™ RNA/DNA calculator and following the 
manufacturer's instructions. 

The PCR fragments were then subjected to Exo III digestion and 
subsequent joining procedure. The various PCR fragments were mixed 
together (0.15 pmol DNA for each fragment), in a cooled (6°C) reaction 
mixture of 12 jil containing: 1.2 \il 10 x TA buffer (330 mM Tris-acetate, 
pH 7.8; 660 mM potassium acetate, 100 mM magnesium acetate and 5 
mM DTT); 0.8 \d Exo III (200 U/jil purchased from Epicentre 
Technologies); and sterile double distilled H2O to make up the final 
volume of 12 (al In practice, the PCR fragments were mixed into a 
precooled, namely, 6°C, TA buffer solution made up to 11.2 |il with the 
H2O, to which was then added the 0.8 [d Exo III. Adding the Exo III last 
provides for better control over the Exo III reaction, which is controlled by 
the time of incubation of the PCR fragments with the Exo III. Following 
the Exo III addition, the reaction mixture was then incubated at 6°C for 
40 mins. (the time necessary to achieve more than 20 nucleotide 
degradation of each DNA strand in the 3'-5' direction under the above 
conditions of temperature = 6°C and concentration of Exo III). 
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The Exo III reaction was then stopped by performing a phenol extraction. 
This is done by adding to the above reaction mixture an equal volume = 12 
|il of a 1:1 (v/v) phenol/chloroform mixture which causes denaturation of 
the Exo III The aqueous phase was then separated from the above 
phenol/chloroform mixture, this aqueous phase contains the PCR 
fragments. The separated aqueous phase was then subjected to three 
washes with chloroform to yield a final aqueous, phase having an 
essentially purified mixture of the Exo Ill-digested PCR fragments. 

In order to prevent both the evaporation of the buffer and the drying of the 
fragments, 40f.il of mineral oil was added to the vessel containing the Exo 
Ill-digested PCR fragments. The mixture was then heated to 75°C, at 
which temperature it was further incubated for one hour. After this 
incubation, the mixture is slowly cooled, under conditions providing only a 
2°C decrease in temperature per hour, until it reached 37°C (the heating 
and cooling to provide for specific complementary interactions between 
complementary overhangs on the PCR fragments and to prevent non- 
specific interactions). This heating and cooling represents the first stage 
of the specific joining between the fragments, the joining by way of 
hydrogen bonding between complementary overhangs. 

Once the above mixture has reached 37°C, it was then subjected to the 
final stage of the joining, including the filling in reaction, as follows: 

To the cooled (at 37°C) mixture of now essentially joined (by hydrogen 
bonding of complementary overhangs) PCR fragments there was added 10 
[il of the 'Synthesis Mixture', which contains: 
ljil of 20 mM ATP 

4 yd of 2.5 mM dNTPs (= dATP, dTTP, dCTP and dGTP in equal 
amounts, concentration of each = 2.5 mM) 

1 liI of 10 x TA buffer (see above for constituents) 
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1 pi of T7 DNA polymerase (5 U/^l, purchased from USB) 

1 ul of T4 DNA ligase (10 U/^il, purchased from Epicentre 
Technologies) 

2 ^1 of sterile double -distilled H2O 
Total volume: 10 ul 

The above synthesis mixture had the T7 DNA polymerase and dNTPs to 
facilitate the filling in of gaps in the junction regions between the joined 
fragments, as well as the ATP and the T4 ligase to covalently join the 
DNA strands together once the filling in of gaps has been completed. 

Following the addition of the synthesis mixture, the resulting reaction 
mixture was then incubated for two hours at 37°C. After this incubation, 
the reaction mixture was then ethanol precipitated under standard 
conditions to finally yield a pellet of precipitated DNA which was 
essentially the completed DNA construct composed of the joined PCR 
fragments. This DNA pellet was resuspended in 5 yd sterile double- 
distilled H2O and is ready for further analysis or use. 

Following the preparation of the desired construct described above from 
the three PCR fragments, this construct was analyzed for its biological 
activity, namely, whether or not it could confer resistance to all three 
antibiotics when introduced into bacterial cells. Thus, electroporation of 
electrocompetent DH10B E. coli cells was performed using a 2 yil aliquot of 
a 5 jal final product containing the DNA construct. After electroporation, 
the cells were first plated on agar plates containing ampicillin. The 
results revealed more than 1000 colonies on these plates, indicating that 
more than 1000 originally transformed cells received a DNA construct 
having at least an active Amp r gene. Of these Amp 1 colonies, 40 were 
chosen at random, as a test sample, and were plated on both tetracycline- 
and chloramphenicol-containing agar plates. All 40 of these test colonies 
grew on these plates as well, indicating that they were also Tet r and Cm r . 
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Hence. it is concluded that at least these 40 colonies received an intact 
construct in which aU of the Tet', Cm' and Amp' genes were intact and 
fully expressible. Some of the colonies were tested further by PCR, as 
indicated above. The PCR bands that appeared were of the expected sizes. 

In the above example, the ligation and fill-in steps have been used for the 
sake of completeness and to illustrate possible alternative procedures. 
However, as wiU be easily understood by the skilled person, said ligation 
and fill-in steps are not necessary, and the procedure exemplified above 
can be carried out without using such steps. Examples of such procedures 
without said steps have not been given, for the sake of brevity. 

Example 5 

Assembly of 8 DNA Fragments into a Circularized Plasmid Usinff 

UDG-created Overhangs 
The plasmid to be constructed was designed to consist of eight fragments, 
each to be prepared separately by PCR amplification and then joined in a 
specific directional fashion to provide a circularized plasmid as the end- 
product. For the purposes of exemplifying the method of the present 
invention, it was chosen to combine eight PCR fragments which together 
encompass four regions, as detailed below, thus carrying out a specific 
directional connection of eight independent PCR fragments to form a 
single active plasmid construct, and this by an essentially one-step 
procedure in accordance with the present invention, a result never 
obtained or attempted in the prior art. 

This example is similar to Example 1, as the overhangs are created by the 
use of dU in the primer and the enzyme UDG to expose the overhang. 
However, in the present example, only one dU residue per primer is used. 
Therefore, the procedure of exposing the overhang comprises an additional 
reagent, N.N Dimethyl-ethylenediamine. The use of this reagent is 
explained in further detail in the section of general procedures above and 
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in the above mentioned copending patent application, identified as 
Attorney Docket 4149/96, Israeli Patent Application No. 120337. In 
addition, the present example uses primers that contain, at their 5' end, 
the natural gene sequence of the fragment to which they are to be joined. 
The junction is therefore seamless: no unnecessary residues are added at 
the junctions. In contrast, Example 1 uses primers that contain at their 5' 
end an irrelevant sequence whose only purpose is to serve as an overhang, 
since several dU residues must be used in the 5' part of the primer 
sequence. This irrelevant sequence is then introduced into the resulting 
fragment between the natural gene sequences of the fragments that are 
joined. Thus, the use of only one dU nucleotide, as in the present example, 
enables the seamless joining of fragments. 

In Fig. 13, there is shown schematically the plasmid that was designed 
and produced by the method of the present invention. This plasmid 
carries four independent antibiotic resistance genes, for resistance to 
Ampicillin (Amp r gene, or hereinafter Amp) ; Tetracycline (Tet r gene, or 
hereinafter Tet); Chloramphenicol (Cm r gene, or hereinafter Cm) and 

Kanamycin (Kn r gene, or hereinafter Kn). The plasmid also carries the 
ColEl origin of replication (ColEl-ori), which in this specific instance is 

situated next to the Amp r gene. Hence, such a plasmid is capable of being 
replicated in a host cell and will endow the host cell with resistance to all 
four types of antibiotic. 

Accordingly, it is possible to readily select for those host cells transformed 
by this plasmid by growing the transform cells in the presence of one of 
the antibiotics and then to screen for the resistance to the other 
antibiotics. In order to verify that mis-connections did not occur, 
additional verification tests may be carried out. One such test carried out 
included testing transformed colonies by restriction enzyme analysis. 
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Plasmid DNA was prepared from a number of transformed colonies and 
checked by restriction enzymes. 

For this eight fragments construction, the following was carried out: 

(i) Preparation of the specific primers for the PCR amplification . 

As shown in Fig. 13, and as outlined above, it was desired to construct a 
circularized plasmid having four regions: 

(a) a Tet 1 ' region. 

(b) an Amp 1, + ColEl-ori region. 

(c) a Cm 1 region. 

(d) a Kn r region. 

The order of the connection is depicted in figure 13. 



To achieve this task, eight separate fragments were designed: 



Name of fragment 
AmpA 

AmpB 27342 



CmA 40122 
CmB 



TetB 



KnA 31254 
KnB 



241365 



241366 



27343 
25596 



4144 



TetA 27341 
30402 



36176 



25595 



25953 
25952 



31253 



3885 



Size(bp) 


plasmid 


Location 


574 


PBR322 


3603-4159 


1171 


PBR322 


2460-3624 


481 


PACYC1 


4021-240 


293 


PACYC1 


3768-4043 


862 


PBR322 


1-848 


631 


PBR322 


827-1449 


569 


PACYC1 


1809-2368 


652 


PACYC1 


2349-2991 
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Written are the names of the fragments, their sizes, the plasmid from 
which they were PCR amplified, the exact location on the original plasmid 
of the site to be amplified and the numbers of the primers that were used 
to amplify the fragments. The full sequence and maps of the various 
regions of these plasmids are well known and can be accessed from 
GenBank database under accession Nos. J01749 (pBR322), X06403 
(pACYCI84) and X06402 (pACYC177). 

The AmpB fragment includes the 5' part of the Amp fragment and the 
ColEl-ori sequence. The AmpA fragment includes the 3' part of the Amp 
fragment. The CmA fragment includes the 5' part of the Cm fragment and 
the CmB fragment includes the 3' part of the Cm fragment. The TetB 
fragment includes the 5' part of the Tet fragment and the TetA fragment 
includes the 3' part of the Tet fragment. The KnA fragment includes the 5 ! 
part of the Kn fragment and the KnB fragment includes the 3 1 part of the 
Kn fragment. 

Each primer consists of two regions: a 3' region complementary to the DNA 
to be amplified, and a 5' region complementary to the fragment it should 
be connected to. 

Using standard automated procedures to produce polynucleotide oligomers 
(Applied Biosystems, U.S A..), the following primers were synthesized: 

Primers for the amplification of the AmpA fragments 

primer SEQ. ID NO. 35, also designated 241365: 
internal Amp 1 region 
ATTGCTGCAGGCATCGTGGTGUCA 

* 

primer SEQ. ID NO. 36, also designated. 3885: 

Cm r region Amp r region 

AGCGTTGGGTCCTGGCCA - AAGAGTAUGAGTATTCAA 
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Primers for the amplification of the AmpB fragments 

primer SEQ. ID NO. 37, also designated 27342: 

Cm 1 region Amp r region 

ACGCCTG - AACGCAGGAAAGAACAUGTG 



primer SEQ. ID NO. 38, also designated 241366: 

internal Amp r region 

ACACCACGATGCCTGCAGCAAUGG 

* 

Primers for the amplification of the CmA fragments 

primer SEQ. ID NO. 39, also designated 40122: 

Kn r region Cm' region 

AGG CCT GGT ATG AGT C - TCA GGA GCU AAG GAA GCT AAA ATG 

* 

primer SEQ. ID NO. 40, also designated 27343: 

internal Cm r region 
ATTGGCTGAGACGAAAAACATAUTCTC 

Primers for the amplification of the CmB fragment 

primer SEQ. ID NO. 41, also designated 25596: 

internal Cm' region 
ATATGTTTTTCGTCTCAGCCAAUCC 

primer SEQ. ID NO. 42, also designated 4144: 

Amp r region Cm r region 

ATGTTCTTTCCTGCGTT - CAGGCGUTTAAGGGCACCAATAAC 

* 

Primers for the amplification of the TetA fragment: 

primer SEQ. ID NO. 43, also designated 27341: 

internal Tet' region 
ATACCGCAAGCGACAGGCCGAUCATCG 
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primer SEQ. ID NO. 44, also designated 36176: 

Kn antisense Tet sense 

ACGTGGCTTTGTTG - TTCTCATGUTTGACAGCTTATC 

* 

Primers for the amplification of the TetB fragment: 

primer SEQ. ID NO. 45, also designated 30402: 

Amp r region Tet' region 

ATACTCTT - TGGCCAGGACCCAACGCUGCCC 

* 

primer SEQ. ID NO. 46, also designated 25595: 

internal Tet 1 region 
ATCGGCCTGTCGCTTGCGGTAUTCG 

* 

Primers for the amplification of the KnA fragment: 

primer SEQ. ID NO. 47, also designated 31254: 

Tet region Kn region 

ACATGAGAA - CAACAAAG CCACGUTGTGTCTC 

* 

primer SEQ. ID NO. 48, also designated 25953: 

internal Kn region 
AGACGAAATACGCGATCGCUGTTAA 

* 

Primers for the amplification of the KnB fragment: 

primer SEQ. ID NO. 49, also designated 25952: 

internal Kn region 
AGCGATCGCGTATTTCGTCUCGCTC 

primer SEQ. ID NO. 50, also designated 31253: 

Cm region Kn region 

AGCTCCTGA • GACTCATACCAGGCCUGAATCG 

★ 

(ii) Preparation of the PCR fragments 

PCR reactions were carried out as in the other examples above. Following 
PCR synthesis of the individual fragments, each fragment was purified by 
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standard agarose-gel purification techniques using the commercially 
available Bio-Rad "Prep-A-Gene™" DNA purification kit and adhering to 
the manufacturer's instructions. Following purification, the concentration 
of the purified fragment DNA was determined by standard procedures 
using the Pharmacia "Gene-Quant™ RNA/DNA Calculator" and adhering 
to the manufacturer's instructions. 

(iii) Connection of the PCR-produced fragments 

The eight PCR fragments as produced and purified according to the above- 
mentioned procedure, were connected to each other in a one-step reaction 
mixture in a single reaction vessel. This was achieved by mixing the 
fragments together in a 25|il reaction mixture that included: 0.15 pmol of 
each fragment, 2.5^1 buffer (200mM Tris-HCl pH 8.4, 500 mM KC1, 15mM 
MgCh), 2.5 jal of 1M N,N-Dimethyl-ethylenediamine and 6.25 units of 
UDG (GibcoBRL). The mixture was incubated at 37°C for 4 hours and 
then transferred to 70°C for 5 minutes to facilitate dissociation of the short 
nicked single-stranded DNA from the 5 ! ends of the fragments. After 
dissociation the short nicked single-stranded DNA were removed using 
"QIAquick PCR purification kit" (QIAGEN) adhering to the manufacturer's 
instructions. Before adding the first buffer of the kit, 200 \il of hot (70°C) 
buffer (20mM Tris-HCl pH 8.4, 50 mM KC1, 1.5xnM MgCl 2 ) was added in 
order to minimize the reannealing of the short nicked single-stranded 
DNA. The DNA was eluted in 30|il of double-distilled water (DDW). 27\d of 
the DNA was incubated with 3^1 of buffer (20mM Tris-HCl pH 8.4, 50 mM 
KC1, 1.5mM MgCb) in a water-bath, at 70°C. The bath was shut down and 
the tempex-ature was slowly decreased to 37°C. This allows for the joining 
between the complementary overhangs, while minimizing illegitimate 
connection. 



WO 98/38298 



PCT/IL98/00096 



-52- 

(iv) Analysis of the efficiency of joining of the fragments 

A 1 \xl sample of the above DNA, containing the newly constructed 
plasmid, made from joining the 8 separate fragments, was used to 
transform E. coli DH10B cells by electroporation. In the transformation 
procedure 20 of electrocompetent "ElectroMax" cells (purchased from 
Gibco BRL) were mixed with the above DNA sample and subjected to 
electroporation in a commercially available apparatus (BioRad "E. coli 
Pulser Apparatus" set at 1.8 kV and operated according to the 
manufacturer's instructions). 

Following electroporation (transformation) the cells were plated on LB 
Agar plate containing 100 mM Ampicillin (to select for transformants 
having Ampicillin resistance by virtue of having being transformed with a 

DNA carrying the Amp r gene). On the day after, three colonies were picked 
and checked for resistance to Chloramphenicol, Tetracycline and 
Kanamycin by plating on Agar plates containing the appropriate 
antibiotics. 

The results showed that the three colonies were resistant to all the four 
antibiotics. The colonies were further checked by restriction enzyme 
analysis and proved to be correct (data not shown). 

The above result is very significant since it shows that it is possible to join 
correctly 8 separate DNA fragments in a specific directional manner, in a 
single reaction mixture by an essentially one-step procedure. 

All the above description of preferred embodiments and examples have 
been provided for the purpose of illustration, and are not meant to limit 
the invention. Many modifications can be made in the methods, materials 
and conditions, and many different products and results can be obtained, 
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using different numbers of overhangs and different numbers of fragments 
to be joined, all without exceeding the scope of the invention. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Gil Sharon and Gesher Israel Advanced 

Biotecs (1996) Ltd. 

(B) STREET: POB 98 

(C) CITY: Beit Neqofa 

(E) COUNTRY: Israel 

(F) POSTAL CODE (ZIP): 90830 

(G) TELEPHONE: +972 2 5700 381 
(K) TELEFAX: +972 2 5700 586 



(ii) TITLE OF INVENTION: Improved DNA Assembly Method 
(iii) NUMBER OF SEQUENCES: 50 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
AGCTCCTGAT TCTCATGTTT GACAGCTTAT C 31 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CUACUACUAC UATGGCCAGG ACCCAACGCT GCCCGAG 37 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 0 base pairs 



WO 98/38298 



PCT/BL98/00096 



-55- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CUACUACUAC UACUATGGCC AGGACCCAAC GCTGCCCGAG 40 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CUACUACUAC UACUACUATG GCCAGGACCC AACGCTGCCC GAG 43 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CUACUACUAC UACUACUACU ATGGCCAGGA CCCAACGCTG CCCGAG 4 6 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CUACUACUAC UACUACUACU ACUACUACUA TGGCCAGGAC CCAACGCTGC CCGAG 55 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unlcnown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ATTGGTGCCC TTAAACGCCT GAACGCAGGA AAGAACATGT G 41 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
UAGUAGUAGU AGAAGAGTAT GAGTATTCAA CATTTC 36 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
UAGUAGUAGU AGUAGAAGAG TATGAGTATT CAACATTTC 39 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
UAGUAGUAGU AGUAGUAGAA GAGTATGAGT ATTCAACATT TC 42 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
UAGUAGUAGU AGUAGUAGUA GAAGAGTATG AGTATTCAAC ATTTC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
UAGUAGUAGU AGUAGUAGUA GUAGUAGUAG AAGAGTATGA GTATTCAACA TTTC 54 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AUCAUAUCAU AUTTCTCATG TTTGACAGCT TATCATC 37 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAUCAUCAUC AUCAGGCGTT TAAGGGCACC AATAACTG 38 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
UACUAUACUA UACATGCCGG CGATAATGGC CTGCTTC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
UAUAGUAUAG UAGTAGATGA CGACCATCAG GGACAGC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AUCUAAUCUA AUTACGGTGA AAACCTGGCC TATTTCC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AUGAUGAUGA . UGAACGCAGG AAAGAACATG TGAGCA 
E2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAUAUCAUAU CAUAUCAUAU TTCTCATGTT TGACAGCTTA TCATC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CAUCAUCAUC AUCAUCAUCA UCAGGCGTTT AAGGGCACCA ATAACTG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CUAUACUAUA CUAUACUAUA CATGCCGGCG ATAATGGCCT GCTTC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
UAUAGUAUAG UAUAGUAUAG GTAGATGACG ACCATCAGGG ACAGC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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CUAAUCUAAU CUAAUCUAAU TACGGTGAAA ACCTGGCCTA TTTCC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AUGAUGAUGA UGAUGAUGAU GAACGCAGGA AAGAACATGT GAGCA 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AUAUGAUAUG AUTCAGGAGC TAAGGAAGCT AAAATGG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AUUAGAUUAG AUCAGGCGGG CAAGAATGTG AATAAAG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AUAUGAUAUG AUAUGAUAUG TCAGGAGCTA AGGAAGCTAA AATGG 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
AUUAGAUUAG AUUAGAUUAG CAGGCGGGCA AGAATGTGAA TAAAG 4 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 
"(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
ATTGGTGCCC TTAAACGCCT GAACGCAGGA AAGAACATGT G 41 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AGCGTTGGGT CCTGGCCAAA GAGTATGAGT ATTCAACA 38 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AGCTCCTGAT TCTCATGTTT GACAGCTTAT C 



31 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ATACTCTTTG GCCAGGACCC AACGCTGCCC 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
AAACATGAGA ATCAGGAGCT AAGGAAGCTA AAATG 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
ATGTTCTTTC CTGCGTTCAG GCGTTTAAGG GCACCAATAA C 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ATTGCTGCAG GCATCGTGGT GUCA 
(2) INFORMATION FOR SEQ ID NO: 36: 



WO 98/38298 



-63- 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
<B) TYPE: nucleic acid 
(CJ STRANDEDNESS : single 
(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
AGCGTTGGGT CCTGGCCAAA GAGTAUGAGT ATTCAA 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
ACGCCTGAAC GCAGGAAAGA ACAUGTG 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
ACACCACGAT GCCTGCAGCA AUGG 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
AGGCCTGGTA TGAGTCTCAG GAGCUAAGGA AGCTAAAATG 
(2) INFORMATION FOR SEQ ID NO: 40: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
ATTGGCTGAG ACGAAAAACA TAUTCTC 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
ATATGTTTTT CGTCTCAGCC AAUCC 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
ATGTTCTTTC CTGCGTTCAG GCGUTTAAGG GCACCAATAA C 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
ATACCGCAAG CGACAGGCCG AUCATCG 
(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 
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( A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



ACGTGGCTTT GTTGTTCTCA TGUTTGACAG CTTATC 
(2) INFORMATION FOR SEQ ID NO: 45: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ATACTCTTTG GCCAGGACCC AACGCUGCCC 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
ATCGGCCTGT CGCTTGCGGT AUTCG 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 



ACATGAGAAC AACAAAGCCA CGUTGTGTCT C 



(2) INFORMATION FOR SEQ ID NO: 48: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
AGACGAAATA CGCGATCGCU GTTAA 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
AGCGATCGCG TATTTCGTCU CGCTC 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AGCTCCTGAG ACTCATACCA GGCCUGAATC G 



31 
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CLAIMS : 

1. A method for assembling two or more DNA fragments with high 
efficiency, comprising: 

a) providing, for each DNA fragment, at least one protruding 
terminus, or "overhang", capable of hydrogen bonding to a complementary 
sequence on at least one strand of a second DNA fragment, said overhang 
having at least 15 bases; and 

b) mixing two or more said DNA fragments under conditions 
suitable to promote joining thereof. 

2. A method according to claim 1, wherein three or more fragments are 
joined, and the molar ratio of each DNA pair is between about 1:1 and 
1:50. 

3. A method according to claim 1 or 2, wherein the overhang has at least 
20 bases. 

4. A method according to claim 1 or 2, wherein the number of bases in the 
overhang is between about 20 and about 30. 

5. A method according to any one of claims 1 to 4, wherein the number of 
fragments joined simultaneously is three or more. 

6. A method according to any one of claims 1 to 5, essentially as described 
and with particular reference to the examples. 

7. A DNA construct made by joining two or more DNA fragments each 
comprising at least one protruding terminus, or "overhang", capable of 
hydrogen bonding to a complementary sequence on at least one strand 
of a second DNA fragment. 
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8. A DNA construct, whenever prepared by the method of any one of 
claims 1 to 6. 

9. A DNA fragment comprising an overhang of at least 15 nucleotides or 
an end portion suitable to be converted into such an overhang. 

10. A DNA fragment as claimed in claim 9, for use in the method of claim 
1. 
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